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ABSTRACT 

The present study deals with application of time-series modelling in agriculture. The mustard yield data of Hisar, 
Bhiwani, Sirsa, Mahendergarh and Gurgaon districts of Haryana have been considered for this empirical study. 
Autoregressive integrated moving average (ARIMA) models have been fitted using the district-level time series yield(s) 
from 1966-67 to 2010-11 and the models have been validated for the post-sample period(s) 2011-12 to 2013-14. After 
experimenting with different lags of the moving average and autoregressive processes; ARIMA(0,1,1) for Hisar, Bhiwani 
and Sirsa districts and ARIMA( 1,1,0) for Mahendergarh and Gurgaon districts have been fitted for mustard yield(s) 
estimation in Haryana. A perusal of the results indicates that the percent deviations of the forecast yield(s) from the 
observed yield(s) are within acceptable limits and favours the use of ARIMA modelling to get short-term forecast 
estimates. 
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INTRODUCTION 

Regression analysis is the most frequently used statistical technique for investigating and modelling the 
relationship between variables. However, some applications of regression involve regressor and response variables that 
have a natural sequential order over time and then the need of time series modelling arises for the analysis of such 
dependence. Time series models have advantages in certain situations. They can be used more easily for forecasting 
purposes because the historical sequences of observations upon study variables are readily available at equally spaced 
intervals over discrete point of time. These successive observations are statistically dependent and time series modelling is 
concerned with the analysis of such dependence. The application of the Box-Jenkins (1976) univariate ARIMA models in 
the field of agriculture for forecasting a variety of study variables of interest for different crops / regions etc. may be of 
immense importance. 

The importance of agriculture for the Indian society can hardly be over emphasized, as its role in economy, 
employment, food security, self-reliance and general well-being does not need reiteration. India has a very well established 
system for collection of crop statistics at village level and aggregating it at different administrative levels. However, the 
need for early and in-season crop production forecasting has been strongly felt. Fulfilling this requirement entails judicious 
planning based on information related to various aspects of agriculture. Information on crop acreage and production are 
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important inputs for strategic planning. 

Panse (1952, 59, 64) in a series of papers studied the trends in yields of rice and wheat with a view to compare the 
yield rates during the plan periods with that of the pre-plan periods. Just to cite a few; Balanagammal et al. (2000) 
obtained forecast values by using ARIMA models for the data related to cultivable area, production and productivity of 
chosen crops in Tamil Nadu, India. Verma and Grover (2006) developed ARIMA models for wheat yield forecasting in all 
the districts of Haryana state (India). Further, a comparison was shown with remote sensing based yield forecasts and real 
time yield data as well. Awal and Siddique (2011) applied the ARIMA models to efficiently forecasting Aus, Aman and 
Boro rice production in Bangladesh. 

Rapeseed-mustard is the third important oilseed crop in the world after soybean and palm oil. India is one of the 
largest rapeseed-mustard growing countries in the world, occupying the first position in area and third position in 
production after the EU27 and China, and contributing around 11% of the world’s total production. Among the seven 
edible oilseeds cultivated in India, rapeseed-mustard (Brassica spp.) contributes 28.6% in the total production of oilseeds. 
It is the second most important edible oilseed after groundnut sharing 27.8% in the India’s oilseed economy. Indian 
mustard is predominantly cultivated in Rajasthan, UP, Haryana, Madhya Pradesh and Gujarat and collectively represent 81 
per cent of the national acreage and contribute 82.9 per cent to the total rapeseed-mustard production. Rapeseed is grown 
on nearly 13% of the cropped land in India. It is basically a winter (i.e. the rabi season) crop and is grown from September- 
October to February-March. The crop grows well in areas receiving 25cm to 40cm of rainfall and this is provided by the 
monsoon rains during the sowing season of the crop in India. 

In view the importance of the subject matter, an attempt has been made to develop ARIMA models for mustard 
yield prediction in Hisar, Bhiwani, Sirsa, Mahendergarh and Gurgaon districts of Haryana. 

Data Description and Statistical Methodology 

The Haryana state comprising of 21 districts (geographical area: 44212 sq. km) is situated between 74° 25’ to 77° 
38’ E longitude and 27° 40’ to 30° 55’ N latitude. The time-series data of state Department of Agriculture (DO A) mustard 
yield spanning 1966-67 to 2013-14 (Source: esaharyana.gov.in/State Statistical Abstract/) were collected for ARIMA 
modeling. The emphasis has been given in predicting the future values on the basis of previous time-series observations. 
The district-level mustard yield from 1966-67(Hisar, Mahendergarh and Gurgaon)/ 1972-7 3 (Bhiwani)/ 1975-76(Sirsa) to 
2010-11 have been used for the training set and the remaining yield data i.e. 2011-12, 2012-13 and 2013-14 have been used 
for the post-sample validity checking of the developed ARIMA models. 

Box-Jenkins Arima Modeling Procedure 

Univariate Box-Jenkins(UBJ) ARIMA forecasts are based only on past values of the variable being forecast. The 
method applies to both discrete as well as to continuous data. However, the data should be available at equally spaced 
discrete time intervals. Before attempting to choose an appropriate ARIMA model for forecasting, it is necessary to make 
the data series stationary. One of the simplest transformations called ‘differencing’ is used when the mean of a series is 
changing over time and log transformation is used if the variance of a series is changing through time. The estimated 
autocorrelation function and partial autocorrelation function are very important tools at the identification stage. An 

estimated autocorrelation function r k shows the correlation between ordered pairs ( ^ ^ * +k ) separated by various time 
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spans ( k=l,2,3 , ). An estimated partial autocorrelation function ^ ^ shows the correlation between ordered pairs 

(Y i Y t ^ se p ara ted by various time spans (k = 1, 2, 3 ) with the effect of intervening observations ( Yt +u Yt +2? 

Y t + k _i) accounted for. 

The general functional form of ARIMA model used is : 

Autoregressive Integrated Moving Average model i.e. ARIMA (p,d,q) <f> p (B) A d Y t = c'+ 6 q (B) a t> where c' = 0 if Y t 
is adjusted for its mean 

Where Y = Variable under forecasting, B - lag operator, a - error term (Y- Y , where Y i s the estimated value of 
Y), t- time subscript, </> v (B) - non-seasonal AR component(s), (1-B) d - non-seasonal difference, 0 q (B) - non-seasonal MA 
component(s); ^’s and 0’s are the parameters need to be estimated 

Further, an attempt is made at the estimation stage to obtain the precise estimates of a small number of parameters 
of the model. Linear least-squares may be used to estimate only pure AR models. All other models require a non-linear 
least squares (NLS) method. Thirdly, the diagnostic tests are performed to see the random shocks to be independent or not. 

RESULTS AND DISCUSSION 

The ARIMA methodology has been applied for estimating mustard yield(s) in Hisar, Bhiwani, Sirsa, 
Mahendergarh and Gurgaon disticts of Haryana. UB J- identification involved the determination of the appropriate orders of 
AR and MA polynomials i.e. the values of p and q. Thus, the orders were determined from the autocorrelation functions 
and partial autocorrelation functions of the stationary series. 

The mustard yield(s) data were found to be non stationary for all the districts. Almost all the autocorrelations upto 
8/10 lags significantly different from zero shown in Figure 1 confirm non-stationarity. Differencing of order one was 
sufficient for getting an appropriate stationary series of all the districts however the log transformation was also applied for 
Hisar district. 
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Figure 1: Autocorrelation Coefficients of Mustard Yield for Different Districts of Haryana 


The models ARIMA (1,1,1), ARIMA (1,1,0) and ARIMA (0,1,1) were considered at the identification stage and 
parameter estimation was carried out using a non-linear least squares (NLS) approach. After experimenting with different 
lags of the moving average and autoregressive processes; ARIMA(0,1,1) for Hisar, Bhiwani and Sirsa districts and 
ARIMA( 1,1,0) for Mahendergarh and Gurgaon districts were fitted for estimating district-level mustard yield(s) in 
Haryana. Parameter estimates of the selected models shown in Table 1 are less than one (needed for convergence) and also 
satisfy the stationarity and invertibility conditions under ARIMA structure. 


Marquardt algorithm (1963) was used to minimize the sum of squared residuals. Log Likelihood, Akaike’s 
Information Criterion, AIC (1969), Schwarz’s Bayesian Criterion, SBC (1978) and residual variance decided to estimate 
AR and MA coefficients in the models. Approximate ‘t’ values were calculated for residual autocorrelation coefficients 
using Bartlett’s approximation for the standard error of the estimated autocorrelations. The residual acf along with the 
associated ‘t’ tests and Chi-squared test suggested by Ljung and Box (1978) were used for the checking of random shocks 
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to be white noise. All Chi-Squared statistics in this concern have been shown in Table 2. 

Table 1: Results on Stationarity and Invertibility Conditions for AR and MA Coefficients of Fitted Models 


Variable /Mustard Yield 

Model 

Stationarity 

Invertibility 

Hisar 

ARIMA (0, 1,1) 

* 

0.87 

Bhiwani 

ARIMA (0, 1,1) 

* 

0.75 

Sirsa 

ARIMA (0, 1,1) 

* 

0.84 

Mahendergarh 

ARIMA (1, 1,0) 

-0.63 

** 

Gurgaon 

ARIMA (1, 1,0) 

-0.38 

** 


* Stationarity condition is not applicable since the model is MA model 
** Invertibility condition is not applicable since the model is AR model 


It is clear from the above table that both the stationarity and invertibility conditions are satisfied because the 
absolute values of AR and MA coefficients for all the districts are less than one. 


Table 2: Diagnostic Checking of Residual Autocorrelations of Mustard Yield for all the Districts 


District(S) 

Model 

Model Fit Statistic 

Ljung-Box Q Statistic 

RMSE 

MAPE 

SBC 

Statistic 

Df 

Sig. 

Hisar 

ARIMA (0,1,1) 

2.55 

22.51 

2.05 

15.22 

17 

0.58 

Bhiwani 

ARIMA (0,1,1) 

2.27 

19.77 

1.84 

13.69 

17 

0.69 

Sirsa 

ARIMA (0,1,1) 

2.26 

20.32 

1.8 

18.71 

17 

0.35 

Mahendergarh 

ARIMA (1,1,0) 

3.04 

28.37 

2.39 

25.26 

17 

0.09 

Gurgaon 

ARIMA (1,1,0) 

2.46 

25.17 

1.99 

10.97 

17 

0.86 


Finally, a comparison between ARIMA model based mustard yield estimates with observed yield(s) was seen in 
terms of percent relative deviation (RD%). The results presented in Table 3 indicate that the deviations of predicted yield(s) 
from the actual yield(s) are within acceptable limits and thus favouring the use of ARIMA models to get short-term forecast 
estimates. 

Table 3: Comparison of ARIMA Model Based Mustard Yield Estimates with State Department of Agriculture 

Mustard Yield(s) in Different Districts of Haryana 


District(s) 

Forecast Years 

DOA Yield (q/ha) 

Estimated Yield (q/ha) 

RD(%) 

Hisar 

2011-12 

17.07 

16.66 

2.40 

2012-13 

16.78 

17.16 

-2.30 

2013-14 

16.26 

17.66 

-8.60 

Bhiwani 

2011-12 

14.00 

14.21 

-1.50 

2012-13 

16.40 

14.47 

11.77 

2013-14 

15.16 

14.72 

2.90 

Sirsa 

2011-12 

16.78 

16.00 

4.65 

2012-13 

16.47 

16.81 

-2.06 

2013-14 

17.37 

16.77 

3.45 

Gurgaon 

2011-12 

20.25 

18.67 

7.80 

2012-13 

20.26 

20.02 

1.18 

2013-14 

15.94 

18.24 

-14.43 

Mahendergarh 

2011-12 

18.27 

15.86 

13.19 

2012-13 

16.99 

17.36 

-2.18 

2013-14 

16.99 

16.93 

0.35 


RD(%) = {(Observed yield -Estimated yield)/ Observed yield} *100 
DOA = State Department of Agriculture yield estimates 
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